Spoken Dialogue System Using Prosody as Para-Linguistic Information

نویسندگان

  • Shinya Fujie
  • Daizo Yagi
  • Yosuke Matsusaka
  • Hideaki Kikuchi
  • Tetsunori Kobayashi
چکیده

An attitude recognizer of a speaker which uses prosodic features of speech is proposed and it is successfully applied to the dialogue system aiming at agreement formation. We use not only linguistic information but also some sorts of additional information supporting linguistic information in our human communication. In agreement formation dialogues, we are often required to express our attitude (positive or negative) to conversational partners’ proposals. We sometimes reply explicitly in linguistic information. We sometimes reply information ambiguously. However, even in the ambiguous case, we implicitly express our attitude using prosodic information. By realizing the abilities of catching these nuances, the dialogue system can be more sophisticated. In this paper, we implemented an attitude recognizer based on the GMM using prosodic feature parameters. The performance of the system is comparable to the human ability. We also realized a proto-type of spoken dialogue system using the recognizer. We show how these abilities contribute to efficient conversation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosody based attitude recognition with feature selection and its application to spoken dialog system as para-linguistic information

In this paper, prosody-based attitude recognition and its application to a spoken dialog system are proposed. Paralinguistic information plays a important role in the human communication. We aimed to recognize the user’s attitude by prosody, and apply it to a spoken dialog system as para-linguistic information. In order to find important features to recognize the attitude from automatically ext...

متن کامل

A hybrid approach to spoken dialogue understanding: prosody, statistics and partial parsing

Linguistic processing in spoken dialogue systems has to be robust against a large number of phenomena such as recognizer errors, spontaneous speech phenomena and out-of-vocabulary (OOV) words. A commonly used solution to this problem is partial parsing, that aims at detecting only parts of sentences/utterances that are vital for the respective task of the parser. In our paper we present a frame...

متن کامل

A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems

Due to recent advancements in speech technologies, a large number of spoken dialogue systems have been constructed. However, since most of them adopt existing text-to-speech synthesizers, it is rather difficult to reflect the linguistic information obtained during the reply sentence generation well in output speech. A framework is necessary for correctly reflecting higher-level linguistic infor...

متن کامل

Personal Statement for Gina - Anne Levow

My research is strongly interdisciplinary, drawing on methods from computer science to investigate fundamental linguistic questions and applying findings from linguistics to develop improved techniques for automatic computational understanding of natural language. My research lies at the intersection of computational linguistics, natural language processing (NLP), and spoken language processing...

متن کامل

Message-To-Speech: High Quality Speech Generation For Messaging And Dialogue Systems

In this paper, we present a Message-toSpeech (MTS) system that offers the linguistic flexibility desired for spoken dialogue and message generating systems. The use of prosody transplantation and special purpose prosody models results in highly natural prosody for the synthesised speech.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004